Method and apparatus for compression of configuration bitstream of field programmable logic

ABSTRACT

A memory is disclosed that can be utilized with a field programmable gate array. In some embodiments, the memory can include a memory array comprising a plurality of memory banks, each memory bank including at least one memory block, each of the at least one memory block including an array of memory cells; an address decoder coupled to each of the at least one memory block, the address decoder including a comparator coupled to receive an input address and a block address and provide a compare bit that indicates when a portion of the input address matches the block address, and an OR gate coupled to receive the compare bit and a wildcard bit, the OR gate providing an enable to the memory block when either the compare bit or the wildcard bit is asserted; and a logic unit that receives a mode value and the input address and provides the wildcard bit to each of the address decoders. Data can be simultaneously written into the memory array in patterns in accordance with the mode value. For example, in some embodiments the mode value indicates one of four patterns, a normal pattern, a block checkerboard pattern, a bank checkerboard pattern, and an all banks pattern.

BACKGROUND

1. Technical Field

Some embodiments disclosed herein are related to a method and apparatusfor data compression and, in particular, to compression of theconfiguration bitstream of field programmable logic.

2. Discussion of Related Art

Field programmable logic (FPG) is capable of being programmed by theuser (in the “field”) using a configuration bitstream generated by asoftware tool. When the IC containing field programmable logic is usedin a system, it utilizes a non-volatile memory to store theconfiguration bitstream. This non-volatile memory may be a separatememory chip, or it may be incorporated into the FPG chip. In eithercase, the size of the memory utilized in the system is related to thesize of the expected bitstream and impacts the bill of materials (BOM)of the system.

In the case of on-chip non-volatile memory, there may be limits to thesize of the memory that can be economically used. Therefore, the size ofthe configuration bitstream could determine the overall feasibilty ofthe product. The size of the configuration bitstream also impacts thespeed at which the FPG can be programmed.

The speed of programming and memory sizes are also important in thecontext of silicon testing of the FPG IC because programmable logic, bynature, utilizes tens of bitstreams for sufficient testing coverage. Thecost of a silicon test on an Automatic Test Equipment (ATE), or similarequipment, is highly sensitive to the test time and memory requirements,both of which are primarily set by the size of the configurationbitstream. A field programmable logic IC with a smaller bitstream willhave a cost advantage over other systems.

Therefore, there is a need for a method of compressing and decoding datafor utilization, for example, in an FPGA that reduces the size of theinput data and decreases the overall write time.

SUMMARY

In accordance with some embodiments of the present invention a memorysystem is presented. A memory according to some embodiments of thepresent invention can include a memory array comprising a plurality ofmemory banks, each memory bank including at least one memory block, eachof the at least one memory block including an array of memory cells; anaddress decoder coupled to each of the at least one memory block, theaddress decoder including a comparator coupled to receive an inputaddress and a block address and provide a compare bit that indicateswhen a portion of the input address matches the block address, and an ORgate coupled to receive the compare bit and a wildcard bit, the OR gateproviding an enable to the memory block when either the compare bit orthe wildcard bit is asserted; and a logic unit that receives a modevalue and the input address and provides the wildcard bit to each of theaddress decoders. Data can be simultaneously written into the memoryarray in patterns in accordance with the mode value. For example, insome embodiments the mode value indicates one of four patterns, a normalpattern, a block checkerboard pattern, a bank checkerboard pattern, andan all banks pattern.

In some embodiments, a method of writing to a memory array includesreceiving a data value, an input address, and a mode value; andsimultaneously writing the data value into memory cells of the memoryarray in accordance with the input address and the mode value. In someembodiments, the mode value determines a pattern from a set of patternsfor writing into the memory array.

These and other embodiments are further disclosed below with referenceto the following drawings.

BRIEF DESCRIPTION OF THE DRAWINGS

FIG. 1 shows a block diagram of a FPGA system consistent with someembodiments of the present invention.

FIG. 2 illustrates a layout of a configuration memory array consistentwith some embodiments of the present invention.

FIG. 3 shows a memory address consistent with some embodiments of thepresent invention.

FIG. 4 shows a table defining a set of modes consistent with someembodiments of the present invention.

FIGS. 5-8 show a portion of a memory array illustrating operation of themodes shown in FIG. 4.

FIGS. 9A-9C illustrate compression of data exhibiting some examplepatterns to reduce the size of a configuration bitstream consistent withsome embodiments of the present invention.

FIG. 10 shows a block diagram of a decoder consistent with someembodiments of the present invention.

FIG. 11 shows a logic table of the decoding logic unit shown in FIG. 10according to some embodiments of the present invention.

FIG. 12 illustrates a process for writing data into a configurationmemory according to some embodiments of the present invention.

DETAILED DESCRIPTION

In the following description specific details are set forth describingthe embodiments disclosed herein. It will be apparent, however, to oneskilled in the art that some embodiments may be practiced without someor all of these specific details. The specific embodiments disclosedherein are meant to be illustrative but not limiting. One skilled in theart may realize other material that, although not specifically describedherein, is within the scope and spirit of this disclosure.

For overall design optimization, bitstream compression is one factoramongst many (e.g., frequency and power). Often, bitstream compressionis balanced against other factors to optimize a particular system. Someembodiments of the present invention can deliver significant cost andperformance benefits by 1) reducing the size of the non-volatile memoryto store the configuration bitstream, 2) reducing the time forperformance of silicon tests of the field programmable gate array(FPGA); and 3) reducing the time to reconfigure the FPGA. The ability touse a smaller bitstream delivers a cost advantage to products,especially when the memory is on-chip and relatively more expensive.Further, reducing the time for configuring the FPGA and for testing thesystem can produce significant performance advantages.

FIG. 1 shows a block diagram of a FPGA system 100 consistent with someembodiments of the present invention. As shown in FIG. 1, FPGA system100 includes a memory 140 and a FPGA 130. FPGA 130 itself includes aconfiguration memory 120. As is well known, the programmable gate arraysin FPGA 130 are configured by writing to configuration memory 120.Therefore, data written into configuration memory 120 determines thelogic functions performed by corresponding sections of FPGA 130.Further, as is often the case, identical logic functions may beperformed in various portions of FPGA 130. Therefore, the data writteninto configuration memory 120 may be patterned with identical datawritten into different areas of configuration memory 120, correspondingwith the identical logic functions in corresponding areas of FPGA 130.

Configuration memory 120 is typically a volatile memory. Therefore, asshown in FIG. 1, a configuration bitstream 150 can be transmitted to anon-volatile memory 140 by a processor 110. Configuration bitstream 150can then be stored in non-volatile memory 140 and the resulting dataloaded into configuration memory 120 upon start-up or upon areconfiguration user command. Non-volatile memory 140, for example, canbe any memory that provides permanent storage such as FLASH or EEPROMmemories. The size of memory 140 is dependent on the size ofconfiguration bitstream 150. Reducing the size of configurationbitstream 150, then, reduces the size of memory 140. Further, reducingthe size of configuration bitstream 150 results in a reduction in theload time to write configuration bitstream 150 into memory 140 as wellas memory 120. In the case of a silicon test, where multipleconfiguration bit streams are utilized in comprehensive testing of FPGA130, a reduction in the time to configure FPGA 130 can result insignificant cost savings. Further, when memory requirements for testingare large, more expensive testing equipment is needed to perform thetest.

On power-up or in response to a user command, FPGA 130 can receive andload configuration bitstream 150 that has been stored in memory 140 intoconfiguration memory 120. As shown in FIG. 1, configuration bitstreamincludes an address ADD, data DAT, and a mode value MOD. In accordancewith some embodiments of the present invention, a mode value is providedso that memory array 120 can be written in patterned form. The mode busreceives a mode value MOD which is used in conjunction with inputaddress ADD to determine the memory cells selected for writing data DATreceived at a data bus. When the memory chip receives certain modevalues MOD, memory 120 may treat a portion of the address ADD as apattern indicator used to enable predetermined portions of the memoryarray and treat another portion of the address as sub-address to selecta memory cell within each enabled portion. In some modes, configurationmemory 120 may also disregard a portion of address ADD. In that fashion,identical data DAT can be simultaneously written in a pattern onconfiguration memory 120.

Therefore, as illustrated in FIG. 1, Processor 110 generatesconfiguration bitstream 150 and transmits configuration bitstream 150 tomemory 140 for storage. Configuration bitstream 150 includes the address(ADD), the data (DAT), and the mode value (MOD). Processor 110 performspattern identification and generates configuration bitstream 150 with,in some embodiments, much reduced size, which may utilize a much smallermemory 140 for storage. Configuration memory 120 receives the reducedsized configuration bitstream 150 and, from the address ADD and modevalue MOD, writes data into configuration memory 120 appropriately toprogram FPGA 130.

As discussed above, with the added mode function, data DAT may bewritten into multiple memory cells throughout memory 120 with the samesub-address simultaneously according to the pattern specified by thepattern indicator portion of the address ADD. As a result, data createdfor system 100 may be compressed by grouping cells with identical datavalue into patterns corresponding to one of the modes and writing thedata DAT into multiple memory cells simultaneously. Therefore, an inputdata can be compressed based on commonality of data values with respectto their location in memory 120.

As shown in FIG. 1, configuration bitstream 150, which also includes themode, can be provided by processor 110. Software algorithms can beexecuted on processor 110 for generating compressed data from anoriginal data set that describes the logic functions performed by FPGA130. The software algorithms can create greater commonality of datavalues and achieve higher compression rate, in some cases vastlyreducing the size of configuration bitstream 150.

Processor 110 can be any device capable of executing the softwarealgorithm that generates configuration bitstream 150. As such, processor110 may include memory and other data storage media and may includeprocessors for executing software stored in that memory or on other datastorage media.

FIG. 2 illustrates a layout of configuration memory 120 according tosome embodiments of the present invention. As shown in FIG. 2, memory120 includes a memory array 200. Memory array 200 includes a pluralityof memory banks 230, which can be arranged by row and column. Each ofmemory banks 230 includes an array of memory blocks 220. In theembodiment shown in FIG. 2, memory banks 230 each include a 2 by 2 arrayof memory blocks 220, which are labeled NW, NE, SW, and SE. Each ofmemory blocks 220 includes a plurality of individual memory cells (notshown). The addressing of memory array 200 is described with respect toFIG. 3.

The exemplary layout of memory 120 shown in FIG. 2 is an illustrativeexample only. Some embodiments may not separate the banks 230 intoblocks 220, and patterns may be defined on the block level. A memorybank 230 may consist of any number of memory blocks 220 arranged in avariety of configurations, such as, for example, 1 by 4, 2 by 4, 4 by 4,2 by 8 . . . etc. Memory blocks 220 that comprise a memory bank 230 arealso not necessarily immediately adjacent to each other, nor include thesame number of memory cells, but instead may be dispersed throughoutmemory 120 and be of arbitrary size. However, the memory layoutillustrated in FIG. 2 provides a convenient example from which todescribe aspects of embodiments of the present invention.

FIG. 3 illustrates a memory address string 300 indicating an addressADD, which is part of the configuration data stream, consistent withsome embodiments of the present invention. As shown in FIG. 3, memoryaddress string 300 includes the following fields: a bank row select 302,a bank column select 204, a block select 306, a column select 308, andan intra-block location 310. Column select 308 and intra-block location310 together form a cell address field. In the particular example shownin FIG. 3, memory address string 300 includes a 22 bit length, labeledbits 21:0. As indicated, bank row select 302 is bits 21:18, bank columnselect 304 is bits 17:14, block select 306 is bits 13:12, column select308 is bits 11:9; and intra-block location 310 is bits 8:0. However,memory address string 300 may be of any bit size and, additionally, bankrow select 302, bank column select 304, block select 306, column select308, and intra-block location 310 can be of any size. In someembodiments, memory address string 300 is accompanied by data DAT to bewritten into the address ADD indicated in memory address string 300, orin some embodiments a series of data DAT that will be sequentiallywritten into memory array 200 starting from the address ADD indicated bymemory address string 300.

In memory array 200, memory banks 230, which are arranged in rows andcolumns, are identified by bank row select 302, e.g., bits 21:18, andbank column select 304, e.g., bits 17:14. Block select 306, e.g., bits13:12, identifies one of blocks 220 in FIG. 2, e.g. one of the fourblocks NW, SW, NE, and SE in bank 230 indentified by bank row select 302and bank column select 304. Internal to block 220, memory cells can bearranged in column and row order as well. Column select 308, e.g., bits11:9 in FIG. 3, identify one of the columns in the identified block 220.Intra-block location address 310, e.g. bits 8:0, identifies a specificcell in a memory block 220 by identifying the row of cells in block 220.In some embodiments, the cell address may utilize a different addressingmethod than that shown in FIG. 3.

As indicated above, the address configuration shown in FIG. 3 is anillustrative example. The length of the address may vary depending onthe size of memory array 200. The size of each of the portions may varydepending on the grouping of the memory cells in the memory array. Forexample, the block select portion may include 3 bits if there are eightblocks in each bank. The address may also consist of fewer layers ofaddress, such as including only block select and an intra-block locationportions, for example. The address may also include additional layers ofaddress such as having an additional array select portion when a singlememory chip comprises multiple memory arrays.

In accordance with some embodiments of the present invention, as shownin FIG. 1, a mode signal can be provided to memory 120 in order toreduce the size of the configuration bitstream that needs to betransmitted to memory 120. The configuration of FPGA array 130, forexample, often occurs in patterns that can be mapped onto memory 120.The mode signal can be utilized to simultaneously write various patternsinto memory 120, which, instead of including an individual write to eachmemory cell, includes a limited number of bits in configurationbitstream written simultaneously to multiple memory cells in thepattern.

FIG. 4 shows a table illustrating a set of modes consistent with someembodiments of the present invention. In the embodiment shown in FIG. 4,the mode value has two bits to accommodate four modes: ‘00,’ ‘01,’ ‘10,’and ‘11.’ As further shown in FIG. 4, mode value ‘00’ indicates a“normal” mode, mode value ‘01’ indicates a “block checkerboard” mode,mode value ‘10’ indicates a “bank checkerboard mode,” and mode ‘11’indicates an “all banks” mode.

In normal mode (mode value ‘00’), only a memory cell having an addressmatching the input address ADD is written with data DAT. FIG. 5illustrates normal mode where, in reversed color, a memory block 500when the address ADD identified in memory address 300 identifies bankcolumn 1, bank row 1, block NW, and the particular cells within block500 identified by the cell address, which includes column select 308 andintra-block location 310.

As discussed above, when the mode value is 01, the mode is “blockcheckerboard.” This mode is illustrated in FIG. 6. FIG. 6 shows thateither blocks 600 or blocks 602 are written. A pattern indicator bit,for example address bit 12, can be utilized to determine which of blocks600 or blocks 602 are written. For example, if the pattern bit is set to0 blocks 600 can be written into, whereas if the pattern bit is set to1, blocks 602 can be written into. As shown in FIG. 6, the blocksselected to receive data DAT under this mode forms an alternating, orcheckerboard, pattern on memory array 200. In block checkerboard mode,bank row select 302, bank column select 304, and block select 306 areignored. As indicated above, one bit, in the example here one bit ofblock select 306, is utilized as the pattern indicator bit. For example,in the particular example shown in FIG. 3, address bits 21:13 areignored and bit 12 is designated as the pattern indicator bit.Specifically, if address bit 12 is 0, all SW and NE blocks (blocks 600)in each of memory banks 230 are written into at the cell indicated bycolumn select 308 and intra-block location 310. If address bit 12 is 1,then all SE and NW blocks (blocks 602) are written into at the cellindicated by column select 308 and intra-block location 310.

When the mode value is 10 the mode is “bank checkerboard.” This mode isillustrated in FIG. 7. In bank checkerboard mode, one block in each ofeither the odd or even numbered banks is written. The odd numbered banksare those banks where the sum of the column and the row addresses isodd. The even numbered banks are those banks where the sum of the columnand the row addresses is even.

FIG. 7 shows blocks 700, the NW block of even number banks, as beingwritten. Bank row select 302 and bank column select 304 are ignored andone of the ignored bits can be utilized as a pattern indicator bit. Forexample, address bits 21:15 are ignored and bit 14 designated as thepattern indicator bit. In some embodiments, if the pattern indicator bitis 0, then all banks at even locations (the sum of the row number andthe column number is even) are written with the data DAT at the locationindicated by block select 306, column select 308, and intra-blocklocation 310 (e.g., address bits 13:0) to indicate the individual cells.If the pattern indicator bit is 1, then all banks at odd locations (thesum of the row number and the column number is odd) are written with thedata DAT at locations indicated by block select 306, column select 308,and intra-block location 310 (e.g., address bits 13:0). The banksselected to received data DAT under this mode forms an alternatingpattern on the memory array.

When the mode value is 11, the mode is “all banks” mode. The all banksmode is illustrated in FIG. 8, where blocks 800 in every one of banks230 are written. In this case, bank row select 302 and bank columnselect 304 (e.g. address bits 21:14 shown in FIG. 3) are ignored. In theall banks mode, all memory cells in each of banks 230 with the indicatedblock select 304, column select 308, and intra-block location 310address are written with data DAT

The modes illustrated in FIG. 4 are illustrative examples only. The modebus may consist of more or fewer bits allowing more or fewer patterns tobe utilized. The modes may also utilize different portions of the inputaddress ADD as pattern indicator bits, or use more bits to indicateparticular patterns. The patterns determined by the mode MOD and patternindicator portion of the address ADD may also vary without departingfrom the spirit of the present embodiment. For example, address bit 12having the value 0 may select even or odd block columns or even or oddblock rows instead of selecting a checkerboard pattern. Additionally, adifferent set of patterns may be chosen.

System 100 shown in FIG. 1 allows configuration bitstream 150 (whichincludes both address ADD 300 and data DAT) to be reduced in sizerelative to the amount of data written into memory array 200. FIG. 9 A-Cillustrate various patterns that can be formed utilizing the modes shownin FIG. 4, memory array 200 shown in FIG. 2, and the address ADD shownin FIG. 3.

The written pattern illustrated in FIG. 9A, with data A shown writteninto the SE block of each of banks 230, can be written by setting themode MOD to ‘11,’ setting block select to the SE block, and writing thedata A into memory 120. Under those conditions, data A is simultaneouslywritten into each of the memory cells indicated by column select 308 andintra-block location 310 in all of SE blocks in each of banks 230 inmemory array 200.

The written pattern illustrated in FIG. 9B includes data A and data Bwritten bank checkerboard pattern in the SE block. This pattern can bewritten in two steps. In step 1, the mode MOD is set to ‘10,’ thepattern indicator bit is set to select the odd number, setting blockselect 308 to the SE block, and writing data A into the locationindicated by column select 308 and intra-block location 310. In step 2,the mode MOD is set to ‘10,’ the pattern indicator bit is set to selectthe even numbered banks, setting the block select 308 to the SE block,and writing data B into the location indicated by column select 308 andintra-block location 310. Alternatively, the pattern shown in FIG. 9Bcan be written by writing either data A or data B into all SE blocksutilizing the all banks mode and then utilizing the bank checkerboardmode to write the data that was not written in the first step.

The written pattern illustrated in FIG. 9C shows data A, data B, anddata C written into the SE block of each of banks 230. To achieve thedata pattern illustrated in FIG. 9C, the compressed input data may bewritten in four steps. In step 1, mode value MOD may be set to ‘11’ toselect the all banks mode and block select 306 may be set to choose theSE block. Data A can then be written into the memory cells identified bycolumn select 308 and intra-block location 310 into every SE block ofevery bank 230, as is shown in FIG. 9A. In step 2, the mode value MODmay be set to ‘10’ to write in bank checkerboard mode, the patternindicator bit set to choose even locations (e.g., setting bit 14 to 0),and block select 306 set to choose the SE block. Data B is then writteninto the memory cells indicated by column select 308 and intra-blocklocation 310 into the pattern shown in FIG. 9B. In a third step, themode value MOD is set to ‘00,’ normal mode, and bank row select 302,bank column select 304, and block select 306 set to choose the SE blockof the bank located at column 1, row 0. Data C is then written into thecell location of that bank indicated by column select 308 andintra-block location 310. In the fourth step, the mode value MOD is setto ‘00,’ normal mode, and bank row select 302, bank column select 304,and block select 306 set to choose the SE block of the bank located atcolumn 2, row 1. Data C is then written into the cell location of thatbank indicated by column select 308 and intra-block location 310.

As illustrated above, the configuration of system 100 can be performedwith a much compressed data stream relative to writing data individuallyinto the memory array 200. Furthermore, compared to convention methods,in which data strings are serially written into the memory array, system100 according to some embodiments has the ability to simultaneouslywrite the same data into multiple memory cells, considerably shorteningthe total write time.

FIG. 10 shows a block diagram of a decoder 1000 for a memory block 220consistent with some embodiments of the present invention. In general,the mode values can be utilized to form wildcard bits so that an addressdecoder with a wildcard bit can be utilized. As shown in FIG. 10, anaddress comparator 1001 provides a logic bit to OR gate 1003. Addresscomparator can receive bank row select 302, bank column select 304, andblock select 306 and compare that address with the block addressidentifying the particular block 220. Additionally, a wildcard bit froma decode logic unit 1005 is also provided to OR gate 1003. In thatfashion, a memory block is enabled when the block address is equal tothe address applied to the memory or when the wildcard bit is enabled.Conventionally, a memory block would be enabled only when the addressesmatch.

In the embodiment shown in FIG. 10, memory block decoder 1000 includesdecoding logic unit 1005. Decoding logic unit 1005 receives the modevalue MOD and determines the value of the wildcard bit to OR gate 1003accordingly. FIG. 11 shows a logic table executed by decoding logic unit1005 that is consistent with the mode values outlined in FIG. 4 and theblock layout illustrated in FIG. 2. The decoding logic unit 1005 usesthe values of the mode MOD to determine whether the individual block 220should be enabled in accordance with the pattern identified by the modevalue. As shown in FIG. 11, mode bits 1 and 2 and the pattern indicatorbit (either bit 12 or 14) result in a different logic output in each ofthe eight blocks (even or odd plus NW, NE, SW, and SE).

In the table shown in FIG. 11, blocks NW, NE, SW, and SE can beidentified by block select values 00, 01, 10, and 11, respectively. Thelogic output of decoding logic unit 1005 for the particular one ofmemory block 220 is indicated by the type of memory block. Depending onthe mode MOD and the input address ADD, the decoding logic unit may senda pattern enable signal to enable none, one, two, or four block groups.For example, when mode is ‘01’ and bit 12 of the input address ADD is 0,enable signals are sent to block groups odd-bank SW, odd-bank NE,even-bank SW, and even-bank NE. Also, when mode is 11 and block selectof the input address is NW, then enable signals are sent to block groupsodd-bank NW and even-bank NW. In some embodiments, decoding logic unit1005 may be coupled to either pattern enable lines coupled to providewildcard enable signals to decoder circuits coupled to each of thememory blocks 220. In some embodiments, each of memory blocks 220includes a decoding logic unit 1005 providing a wildcard enable signal.

The logic table shown in FIG. 11 is an illustrative example. Other logictables and pattern enable line configurations are possible to implementthe modes listed in FIG. 4. For example, each of the banks could have anenable line, and the six, instead of eight pattern enable lines could beutilized: odd and even banks enable, and NW, SW, SE, and NE blocksenable. The logic table may also vary depending on the modes and thepatterns designs of different embodiments. Further, in some embodimentsone decode logic unit 1005 may be coupled to each of the eight blocktypes so that decode logic unit 1005 provides eight different wildcardvalues, one for each of the eight block types.

As discussed above, some embodiments of the present invention allowinput data DAT to be transmitted in configuration bitstream incompressed form. A software algorithm may optimize the uncompressedinput data mapping according to the patterns of the predetermined modesto minimize the size of the compressed input data. As long as thegranularity of the input data is reasonably large, the overhead ofinserting mode values into the compressed data is insignificant. Thedecode and write time of the disclosed embodiments is also considerablyshortened because multiple blocks are written into simultaneously.

FIG. 12 illustrates a process 1200 to write data into a memory 120according to some embodiments of the present invention. As shown in FIG.12, first the configuration is determined. In this fashion, data that isto be written into memory 120 is determined. In some embodiments, thedata may be compiled with a determination of what data DAT is to bewritten into which memory locations in memory 120. In step 1204,patterns from the list of available modes are determined. In someembodiments, each mode pattern is tested against the compiled data todetermine a set of pattern writes that will accomplish the dataconfiguration determined in step 1202. In step 1206, each of the modepatterns may be tested. In step 1208, configuration bitstream 150 isstored in memory 140 and, from there, data DAT is loaded into memory 120as described above.

Although embodiments according to the following invention can beutilized for other purposes, especially where data is likely to bewritten into patterns in memory 120, one implementation that benefitsfrom the advantages of some embodiments according to the presentinvention is a configuration memory array in a field programmable gatearray (FPGA) devices, which is shown in FIG. 1. FPGAs generally containa configuration memory which stores a user configured bitstream. Whenthe configuration memory is a volatile memory, the bitstream may beloaded through software before each session, or the bitstream may bestored on another non-volatile memory on the FPGA device, such as memory140. Increased size of the non-volatile memory increases cost,therefore,. although a large bitstream size is often preferred, themaximum allowable size of the bitstream influences the economicalfeasibility of the device. The disclosed embodiments allows anon-volatile memory to store a compressed bitstream that is larger thanthe memory's maximum capacity when decoded, thereby reducing the costassociated with a larger non-volatile memory. Some embodiments alsoreduce the load time when bitstream 150 is transferred from non-volatilememory 140 to configuration memory 120.

Some embodiments of the present invention also confer advantages in thesilicon testing of a field programmable logic. During FPGA testing, tensof bitstreams are loaded into configuration memory (volatile ornon-volatile) in order to achieve sufficient reasonable test coverage.The cost of testing is affected by the test time and the memoryrequirement, both of which are effected by the size of input bitstream150. Compressing input bitstream 150 and utilizing simultaneous writesin memory 120 can significantly reduce the transfer time and therebyreduce the time and cost of testing.

The embodiments of the invention described here are exemplary only andare not intended to limit the scope of the invention. One skilled in theart may recognize various modifications to the embodiments specificallydescribed. These modifications are intended to be within the scope ofthis disclosure. As a result, the invention is limited only by thefollowing claims.

1. A memory, comprising: a memory array including a plurality of memorybanks, each memory bank including at least one memory block, each of theat least one memory block including an array of memory cells; an addressdecoder coupled to each of the at least one memory block, the addressdecoder including a comparator coupled to receive an input address and ablock address and provide a compare bit that indicates when a portion ofthe input address matches the block address, and an OR gate coupled toreceive the compare bit and a wildcard bit, the OR gate providing anenable to the memory block when either the compare bit or the wildcardbit is asserted; and a logic unit that receives a mode value and theinput address and provides the wildcard bit to each of the addressdecoders, wherein data can be simultaneously written into the memoryarray in patterns in accordance with the mode value.
 2. The memory ofclaim 1, wherein the portion of the input address and the block addresseach include a bank row select, a bank column select, and a block selectfield, wherein the bank row select determines a row location of aparticular memory bank, the bank column select determines a columnlocation of the particular memory bank, and the block select determinesa particular memory block within the particular memory bank.
 3. Thememory of claim 2, wherein the input address further includes a celladdress to determine a particular cell within the particular memoryblock.
 4. The memory of claim 3, wherein the cell address includes acolumn select and an intra-block location.
 5. The memory of claim 3,wherein the at least one memory block includes four blocks arranged in a2×2 array, the 2×2 array of blocks being designated NW, NE, SW, and SE.6. The memory of claim 5, wherein the mode value indicates one of aplurality of modes indicating a pattern of simultaneous writes.
 7. Thememory of claim 6, wherein the mode value indicates one of fourpatterns, a normal pattern, a block checkerboard pattern, a bankcheckerboard pattern, and an all banks pattern.
 8. The memory of claim7, wherein in normal mode data is written into memory locationsidentified by the input address.
 9. The memory of claim 7, wherein inblock checkerboard mode, the bank row select, the bank column select,and the block select are ignored, a bit from one of the bank row select,the bank column select, and the block select is designated as a patternindicator bit, and wherein either the SW and NE blocks are written to orthe SE and NW blocks are written to depending on the pattern indicatorbit.
 10. The memory of claim 7, wherein in bank checkerboard mode, thebank row select and bank column select are ignored and a bit from thebank row select and the bank column select is designated as a patternindicator bit, and wherein either all banks with even designations orall banks with odd designations are written to at location determined bythe block select and the cell address, depending on the patternindicator bit.
 11. The memory of claim 7, wherein in all banks mode, thebank row select and bank column select are ignored and data are writteninto all locations identified by the block select and the cell address.12. A method of writing to a memory array, comprising: receiving a datavalue, an input address, and a mode value; and simultaneously writingthe data value into memory cells of the memory array in patternsaccording to the mode value and the input address; wherein for at leastone mode value, the input address is interpreted not as specifying asingle memory cell but as specifying a plurality of memory cells atdifferent addresses, for writing the data value into each of the memorycells at the different addresses.
 13. A method of writing to a memoryarray, comprising: receiving a data value, an input address, and a modevalue; and simultaneously writing the data value into memory cells ofthe memory array in patterns according to the mode value and the inputaddress; wherein the memory array includes a plurality of memory banksand at least one memory block within each of the plurality of memorybanks, the input address includes a bank row select, a bank columnselect, a block select, and a cell location, wherein the bank row selectand the bank column select identify a particular memory bank within theplurality of memory banks, the block select determines a particularmemory block within the particular memory bank, and the cell locationdetermines a cell within the particular memory block.
 14. The method ofclaim 13, wherein the mode value determines a pattern from a set ofpatterns for writing into the memory array.
 15. The method of claim 14,wherein the at least one memory bank includes four memory blocks, andthe mode value includes four modes.
 16. The method of claim 15, whereinthe four modes includes a normal mode where data is written into memorylocations identified by the input address.
 17. The method of claim 15,wherein the four modes includes a block checkerboard mode, where thebank row select, the bank column select, and the block select areignored, a bit from one of the bank row select, the bank column select,and the block select is designated as a pattern indicator bit, andwherein either the SW and NE blocks are written to or the SE and NWblocks are written to depending on the pattern indicator bit.
 18. Themethod of claim 15, wherein the four modes includes a bank checkerboardmode, where the bank row select and bank column select are ignored and abit from the bank row select and the bank column select is designated asa pattern indicator bit, and wherein either all banks with evendesignations or all banks with odd designations are written to atlocation determined by the block select and the cell address, dependingon the pattern indicator bit.
 19. The method of claim 15, wherein thefour modes includes an all banks mode, where the bank row select andbank column select are ignored and data are written into all locationsidentified by the block select and the cell address.
 20. A method ofconfiguring a field programmable gate array, comprising: compiling aconfiguration of the field programmable gate array; determining one ormore patterns corresponding to a plurality of modes from theconfiguration; determining a configuration bitstream utilizing thepatterns; and writing the configuration into a memory of the fieldprogrammable gate array according to the one or more patterns.
 21. Themethod of claim 20, further including storing the configurationbitstream in a non-volatile memory.
 22. The method of claim 20, whereinthe memory is partitioned into banks, with each bank including one ormore blocks of individual cells and the plurality of modes includes anormal mode, a block checkerboard mode, a bank checkerboard mode, and anall banks mode.
 23. The method of claim 20, wherein writing theconfiguration into memory includes receiving the configurationbitstream, determining a mode value, and address, and data from theconfiguration bitstream; writing the data into the memory in a patterndetermined by the mode value, at cell locations determined by theaddress.
 24. A memory comprising: a plurality of memory cells; circuitryfor receiving a data value, an input address, and a mode value, andsimultaneously writing the data value into each of one or more memorycells specified by the mode value and the input address; wherein theinput address is interpreted in accordance with the mode value tospecify the one or more memory cells into which the data value iswritten; and for at least one mode value, the input address isinterpreted not as specifying a single memory cell but as specifying aplurality of memory cells at different addresses.
 25. The memory ofclaim 24 wherein for at least one mode value, the input address isinterpreted as specifying a single memory cell.
 26. The memory of claim24 wherein for said at least one mode value, the input address isinterpreted as specifying a repeating pattern of memory cells whichrepeats throughout the plurality of memory cells.
 27. The memory ofclaim 26 wherein: the memory comprises a plurality of memory blocks,each memory block comprising a plurality of locations having respectivedifferent addresses, each location comprising one or more of said memorycells; and the repeating pattern is defined by a repeating pattern ofthe memory blocks.